A corpus study of clause combination
نویسندگان
چکیده
We present a corpus-based investigation of cases of clause combination that can be expressed both through coordination or with subordination. We analyse the data with a two-step computational model which first distinguishes subordination from coordination and then determines the direction for cases of subordination. We find that a wide range of features help with the prediction, notably frequency of predicate participants, presence of adjuncts and sharing of participants between the clause predicates.
منابع مشابه
Application of Clause Alignment for Statistical Machine Translation
The paper presents a new resource light flexible method for clause alignment which combines the Gale-Church algorithm with internally collected textual information. The method does not resort to any pre-developed linguistic resources which makes it very appropriate for resource light clause alignment. We experiment with a combination of the method with the original Gale-Church algorithm (1993) ...
متن کاملEffects of structural prominence on anaphora: the case of relative clauses
In this paper we present a corpus study and a sentence completion experiment designed to evaluate the discourse prominence of entities evoked in relative clauses. The corpus study shows a preference for referring expressions after a sentence final relative clause to select a matrix clause entity as their antecedents. In the sentence completion experiment, we evaluated the potential effect of he...
متن کاملHabeas Corpus and Due Process
The writ of habeas corpus and the right to due process have long been linked together, but their relationship has never been more unsettled or important. Following the September 11, 2001 attacks, the United States detained hundreds of suspected terrorists who later brought legal challenges using the writ. In the first of the landmark Supreme Court cases addressing those detentions, Hamdi v. Rum...
متن کاملRobust clause boundary identification for corpus annotation
The paper describes a rule-based system for tagging clause boundaries, implemented for annotating the Estonian Reference Corpus of the University of Tartu, a collection of written texts containing ca 245 million running words and available for querying via Keeleveeb language portal. The system needs information about parts of speech and grammatical categories coded in the word-forms, i.e. it ta...
متن کاملClause Complexity in Applied Linguistics Research Article Abstracts by Native and Non-Native English Writers: Taxis, Expansion and Projection
Halliday’s Systemic Functional Linguistics (SFL) has stood the test of time as a model of text analysis. The present literature contains a plethora of studies that while taking the ‘clause’ as a unit of analysis have put into investigation the metafunctions in research articles of a single field of study or those of various fields in comparison. Although ‘clause complex’ is another unit of SF a...
متن کامل